CDS

Accession Number TCMCG011C07402
gbkey CDS
Protein Id XP_021891445.1
Location join(513499..513588,514895..515065,515315..515398,515598..515674,517902..517914,518321..518378,518477..518544,518629..518691,519662..519715,519822..519924,520008..520117,520223..520342,521635..521691,527167..527330,527483..527648,528089..528172,528335..528451,529063..529193,529267..529348,529690..529777,530268..530311,530389..530484,531692..531768,532563..532754,533224..533389,533737..533808,534068..534271,534366..534488)
Gene LOC110809818
GeneID 110809818
Organism Carica papaya

Protein

Length 957aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA264084
db_source XM_022035753.1
Definition L-arabinokinase-like [Carica papaya]

EGGNOG-MAPPER Annotation

COG_category G
Description Galactokinase galactose-binding signature
KEGG_TC -
KEGG_Module -
KEGG_Reaction R01754        [VIEW IN KEGG]
KEGG_rclass RC00002        [VIEW IN KEGG]
RC00078        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K12446        [VIEW IN KEGG]
EC 2.7.1.46        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00520        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
map00520        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCGGAATCTAATGCGTCGAAGAAGTCTCTGGTCTTTGCTTACTACGTTACTGGCCATGGATTTGGTCACGCCACTCGTGTTGTTGAGGTTTTAGGGTTATCAATCTGTGCATTATGCTCATTAATCACAAACTTCATGTATCATGTTCGTGCAACTCCATTGACAAATTTTTTTTTTGTCCGATTACAGGTGTTACTAGACTGCGGAGCTGTTCAGGCAGATGCTTTGACTGTTGATCGCCTTGCCTCCTTGGAAAAGTATTCTCAGACTGCAGTAATACCACGAGATTCTATTTTGGCAACGGAGGTGGAATGGCTAAAGTCTATCAAAGCTGACCTTGTGGTTTCAGATGTTGTCCCCGTCGCATGTCGAGCGGCAGTAAATGCTGGAATTCTTTCTGTTTGCGTTACAAACTTTAGCAAAAAGTCTGAGATAGCTGAAGATTATTCCCACTGTGAATTTTTGATACGTCTACCGGGGTACTGTCCTATGCCTGCTTTTCGTGATGTTATTGATATACCTCTTGTTGTGAGGAGGTTACACAAATCTAGAGAAGAGGTGAGGAAAGAGCTTGGAGTTAAGGACAACATGAAGCTAGTAATTTTCAATTTTGGTGGTCAGCCAGCTGGATGGAATTTAAAGGAGGAGTATTTACCGGCTGGTTGGTTGTGCCTTGTCTGTGGTGCTTCAGAGAAGCAGCAGTTTCCCCCTAATTTCATCAAACTCCCAAAAGATGTCTATACCCCTGATCTGATTGCAGCTTCAGACTGCATGCTTGGAAAAATTGGGTATGGAACAGTTAGCGAAGCTCTTGCATATAAGTTGCCATTTGTGTTTGTGCGGAGAGATTACTTTAACGAAGAACCATTTTTGAGGAATATGCTTGAGTTCTACCAGAGTGGTGTTGAGATGATAAGGAGAGATCTGCTGACTGGATGCTGGAGACCCTACCTTGAGCGTGCCCTCTGCTTAAAACCATGCTACGATGGAGGCATCAATGGTGGTGAGGTGGCTGCTCAAATACTGCAAGATACAGCTTTAGGAAAAAAACATTCTTCAAATAATCTTAGTGGAGCAAGGAGATTGCGAGATGCCATAATTCTTGGGTTTCAACTGCAAAGAGCTCCTGGTAGAGATATATCTGTTCCAGAATGGTATAATATGGCAGAAACTGAACTTAGTCTTCGCTCTGCATTACCAACTGGTCAATTAACTCAAATAAGTTCTCAATGCATAGAAGGCTTTGAAATTCTTCATGGGGATCATCTGGGCCTTTCTGATACAGTTAGCTTCTTGAGTGGCTTGGAACAACTAGCTTCTGTATCTGAGTCATCTAAAAGTACCAAAAATCCAACTAGGGAGAATCTGGCTGCTGCTACACTATTCAACTGGGAGGAGGAAATCTTTGTGGCAAGGGCACCTGGGAGGCTAGATGTAATTGGAGGCATCGCAGACTATTCAGGAAGCCTTGTCTTGCTAATGCCTACAAAAGAAGCTTGCCATGTTGCTGTTCAGAGGAATCACCCAAGCAAGCAAAAGCTGTGGAAGCATGCTCAAGCTAGGCAGCATGCCAAAGAAACCCCTATTGTTGAAATTGTATCATTTGGGTCAGAATTAAGTAATCGTGGGCCAACATTTGATATGGACTTATCTGATTTCTTGGATGGTGAGAATCCAATATCCTATGAGAAGGCATGCAAATACTTTGCACAGGATCCTTCTCAAAGGTGGGCGGCATATGTTGCAGGGGTAATTCTTGTACTGATGAAAGAATTAGGGGTTTGCTTTCAGGACAGTATCAGCATACTGGTTTCCTCTGCAGTTCCAGAAGGCAAGGGGGTTTCTTCTTCGGCTGCATTGGAAGTGGCTACCTTGTCTGCTATTGCTGCTGCACATGGTTTGGACCTTGCTCCTAGAGATGTTGCTTTGCTTTGCCAGAAGGTGGAGAATCATGTTGTTGGAGCTCCATGTGGGGTGATGGACCAGATGGCTTCTGCATGTGGCGAAGCAAACAAACTTCTTGCAATGGTATGCCAGCCTGCTGAGGTGCTAGGAGTTGTCGAGATACCTGCCCATGTTCGATTTTGGGGGATTGATTCTGGAATAAGACACAGTGTCGGTGGTTCAGATTATGGTTCCGTGCGAATAGGGGCTTTCATGGGCAGGAAGATTATAAAATCGTTGGCTGCTGCATCATCATCAATCTCTTTGCCAGGAAATAACCCTGAAGAAATTGAGGAGGACAGCTTTGAGTTACTTGCAGAGGAGGAATCGTTGGATTACTTATGCAACCTCACACCTCATAGGTATGAAGCTCTATATGCAAATAAGATTCCAGAGTCCACTACCGGGGAAGAATTCATTAAGATGTATAAAGATCATAATGATTCAGTCACAACAATAGACCAAAAAGTCAGCTATGCAGTAAGAGCACCCACTGCGCATCCAATTTATGAGAACTTCCGTGTTAAGGCTTTTAAAGCATTGCTAACAGCTGCTACTTCCGAGGAACAACTCACTGCTCTTGGGGAACTAATGTATCAGTGTCATTACAGTTATAGTAAGTGCGGACTTGGCTCCGATGGAACAGATAGGCTTGTAAGATTAGTACAAAAAATGCAACACTCTAAGATCTCAAAATCCGAAAACGGTACACTGTATGGAGCAAAGATCACTGGCGGAGGCAGTGGTGGGACTGTTTGTGTGTTTGGGAAGAATTCCTTGAGAAGCAGTGAACAAATTCTTCAGATTCAGCAGAAATACAAAGATGACACTGGATTTATGCCATATGTCTTTGACGGTTCTTCACCGGGTGCTGGCAAGTTTGGATATCTAAAAATTCGTCGCCTCACCTATGCGAAATCTATTTAA
Protein:  
MAESNASKKSLVFAYYVTGHGFGHATRVVEVLGLSICALCSLITNFMYHVRATPLTNFFFVRLQVLLDCGAVQADALTVDRLASLEKYSQTAVIPRDSILATEVEWLKSIKADLVVSDVVPVACRAAVNAGILSVCVTNFSKKSEIAEDYSHCEFLIRLPGYCPMPAFRDVIDIPLVVRRLHKSREEVRKELGVKDNMKLVIFNFGGQPAGWNLKEEYLPAGWLCLVCGASEKQQFPPNFIKLPKDVYTPDLIAASDCMLGKIGYGTVSEALAYKLPFVFVRRDYFNEEPFLRNMLEFYQSGVEMIRRDLLTGCWRPYLERALCLKPCYDGGINGGEVAAQILQDTALGKKHSSNNLSGARRLRDAIILGFQLQRAPGRDISVPEWYNMAETELSLRSALPTGQLTQISSQCIEGFEILHGDHLGLSDTVSFLSGLEQLASVSESSKSTKNPTRENLAAATLFNWEEEIFVARAPGRLDVIGGIADYSGSLVLLMPTKEACHVAVQRNHPSKQKLWKHAQARQHAKETPIVEIVSFGSELSNRGPTFDMDLSDFLDGENPISYEKACKYFAQDPSQRWAAYVAGVILVLMKELGVCFQDSISILVSSAVPEGKGVSSSAALEVATLSAIAAAHGLDLAPRDVALLCQKVENHVVGAPCGVMDQMASACGEANKLLAMVCQPAEVLGVVEIPAHVRFWGIDSGIRHSVGGSDYGSVRIGAFMGRKIIKSLAAASSSISLPGNNPEEIEEDSFELLAEEESLDYLCNLTPHRYEALYANKIPESTTGEEFIKMYKDHNDSVTTIDQKVSYAVRAPTAHPIYENFRVKAFKALLTAATSEEQLTALGELMYQCHYSYSKCGLGSDGTDRLVRLVQKMQHSKISKSENGTLYGAKITGGGSGGTVCVFGKNSLRSSEQILQIQQKYKDDTGFMPYVFDGSSPGAGKFGYLKIRRLTYAKSI